Universality proof and analysis of generalized nested Uhrig dynamical decoupling 
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Nested Uhrig dynamical decoupling (NUDD) is a highly efficient quantum error suppression 
scheme that builds on optimized single axis UDD sequences. We prove the universality of NUDD 
and analyze its suppression of different error types in the setting of generalized control pulses. We 
present an explicit lower bound for the decoupling order of each error type, which we relate to the 
sequence orders of the nested UDD layers. We find that the error suppression capabilities of NUDD 
are strongly dependent on the parities and relative magnitudes of all nested UDD sequence orders. 
This allows us to predict the optimal arrangement of sequence orders. We test and confirm our 
analysis using numerical simulations. 



I. INTRODUCTION 

One of the main obstacles in building a quantum com- 
puter is the inevitable coupling between a quantum sys- 
tem and its environment, or bath, which typically results 
in decoherence and leads to computational errors [TH3]. 
Adapted from nuclear magnetic resonance (NMR) refo- 
cusing techniques [4] [5], dynamical decoupling (DD) [6] is 
a powerful open-loop technique that can be used to sup- 
press decoherence by applying a sequence of short and 
strong pulses purely on the system to mitigate unwanted 
system-bath interactions. 

Early schemes such as periodic DD (PDD) [7HTT]. us- 
ing equidistant 7r-pulses, have been shown to suppress 
general system-bath interactions up to first order in time- 
dependent perturbation theory, with respect to the total 
sequence duration T, thereby achieving first-order de- 
coupling. Concatenated DD (CDD) [T^l H3], which re- 
cursively embeds PDD into itself, was the first explicit 
scheme capable of achieving iV th -order decoupling for 
general single-qubit decoherence, i.e., complete error sup- 
pression to order T N , and has been amply tested in re- 
cent experimental studies 14 18 . However, the number 
of pulses required by CDD grows exponentially with de- 
coupling order: 4 N pulses are required to attain iVth 
order error suppression, which is not efficient enough to 
implement scalable quantum computing when N is large. 
For scalable quantum computing to be possible, it is de- 
sirable to design a DD sequence which is both accurate 
(high decoupling order) and efficient (small number of 
pulses). 

For the single-qubit pure dephasing spin-boson model, 
an optimal scheme called Uhrig DD (UDD) pH 120] . 
achieves N th order decoupling with the smallest possible 



number N of ideal 7r-pulses, applied at non-equidistant 
pulse timings, 
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where N — N or N+l depending on whether ./V is even or 
odd. N is also referred to as the sequence order. The ap- 
plicability of UDD extends beyond the spin-boson model 
to models with pure dephasing interactions, with generic 
bounded baths [HJH2], rendering it model-independent, 
or "universal" . Moreover, it has also been proven to be 
applicable to analytically time-dependent Hamiltonians 
[23] . Extensive numerical and experimental studies of 
UDD performance can be found in [T51 U51 IMJSg ] . Rig- 
orous performance bounds for UDD were found in |22j . 
and for QDD and NUDD in [34]. 

While UDD is applicable to single- and two-axis in- 
teractions (e.g., pure dephasing and/or pure bit-flip), 
it cannot overcome general, three-axis single-qubit de- 
coherence. Quadratic DD (QDD) [3S] and its general- 
ization Nested UDD (NUDD) [36], which were proposed 
to tackle general single and multi-qubit decoherence re- 
spectively, exploit the decoupling efficiency of UDD by 
nesting two- and multi-layer UDD, where sequence or- 
ders of different UDD layers can be different in order to 
address the dominant sources of error more efficiently. In 
|36j . NUDD (including QDD) with even sequence orders 
on the inner levels was verified analytically to achieve 
arbitrary order decoupling with only a polynomial in- 
crease in the number of pulses over UDD, an exponential 
improvement over CDD or CUDD [37] which combines 
orthogonal single-axis CDD and UDD sequences. A uni- 
versality proof of NUDD with arbitrary sequence orders 
was given in [38] , The same result, for QDD with arbi- 



2 



trary sequence orders, was proved independently in our 
previous work [39], using a different method. Our uni- 
versality proof for QDD [35] went beyond quantifying 
the overall performance of QDD, in that we provided a 
thorough analytical study of the suppression abilities of 
the inner- and outer-layer UDD sequences of QDD for 
each single-axis error, and obtained the decoupling order 
of each single-axis error. We found the decoupling order 
dependence on the parities and relative magnitudes of the 
inner and outer sequence orders. However, the analysis 
for the performance of NUDD with arbitrary sequence 
orders on each individual error is still missing. A further 
restriction concerns the type of control pulses, which in 
general QDD or NUDD proofs [38J [39] have so far been 
limited to elements of SU(2) or tensor products thereof. 
Numerical studies of QDD and NUDD performance are 
given in (3J1 M^M 



In this paper, we give a rigorous and compact proof 
for the universality and performance of NUDD (includ- 
ing QDD) with arbitrary sequence orders. The set of 
control pulse types is generalized to the mutually or- 
thogonal operation set (MOOS) defined in [36] . The 
concept of error types is also generalized, in the sense 
that errors are classified according to the types of con- 
trol pulses chosen. Most importantly, we obtain the ex- 
plicit decoupling order formula, a function of the given 
error type, and the parities and magnitudes of all the 
sequence orders of NUDD. This formula shows explicitly 
how each UDD layer contributes to the suppression of a 
given type of error. An immediate consequence is that 
the overall suppression order of NUDD with a MOOS 
as the control pulse set is the minimum among all se- 
quence orders of NUDD. Moreover, our analysis iden- 
tifies the condition under which the suppression ability 
of a given UDD layer is being hindered, or rendered to- 
tally ineffective, or enhanced by other UDD layers with 
odd sequence orders. One can thus design an NUDD 
scheme such that the full power of each UDD layer is 
fully exploited. Note that our proof also shows that the 
performance of NUDD schemes with generalized control 
pulse types and arbitrary sequence orders is universal, as 
in previous UDD-like schemes with Pauli group elements 
as control pulses [HJ [H (Ml GH [39]. In other words, 
the performance of general NUDD sequence remains the 
same for multi-qubit or multi-level systems coupling to 
arbitrary bounded environments. We present numerical 
simulations in support of our analytical results for the de- 
coupling order of each error type, for a four-layer NUDD 
scheme applied to a two-qubit system. 

The structure of this paper is as follows. In Sec. [TTJ a 
general NUDD scheme with MOOS as the control pulse 
set is formulated. In Sec. |III[ the results of the perfor- 
mance of NUDD are presented. Specifically, we present 
the NUDD Theorem (theorem [T]) , which gives the de- 
coupling order formula for each error type. Corollary 
[l] along with its proof, gives the overall performance 
of NUDD. The complete proof of the NUDD Theorem 



NUDD scheme on 2 qubits system are demonstrated 
in Sec. [VJ in support of our analysis. We conclude in 
Sec.[VT] The appendixes provides additional technical de- 
tails. 



II. NUDD FORMULATION 

A. The noise model 

We assume a completely general noise Hamiltonian H 
acting on the joint system-bath Hilbert space, the only 
assumption being that \\H\\ < oo. We allow for arbitrary 
interactions between the system and the bath, as well as 
between different parts of the system or between different 
parts of the bath. We use 



sup 

IV>> 
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to denote the sup-operator norm of any operator A, i.e., 
the largest singular value of A, or the largest eigenvalue 
(in absolute value) if A is Hermitian. 



B. NUDD pulse timing 

A general £-layer NUDD scheme with a sequence order 
set {Ni,N 2 , . . . , Ng} is constructed by concatenating I 
levels of UDD sequences, where Ni is the sequence order 
of the UDDat, sequence at the i th level of NUDD. The 
sequence orders of different UDD layers can assume dif- 
ferent values in order to address the dominant sources of 
error in any particular implementation more efficiently. 
The control pulse operator set {Qi, £1%, . . . , fli}, where 
the subscript i is the layer index, is chosen to be the mu- 
tually orthogonal operation set (MOOS) defined in [36] . 
which consists of independent, mutually commuting or 
anticommuting system operators, each of which is both 
unitary and Hermitian. Obviously, each is required 
not to commute with the total Hamiltonian, for otherwise 
it would not have any effect on the noise. Note that the 
MOOS elements £li are not restricted to be single-qubit 
system operators such as a single-qubit Pauli matrix. 

The normalized ^-layer NUDD pulse timing 



is defined as the actual NUDD tim- 



ing, divided by total evolution time T, where 
ji 6 {1, . . . , Ni + 1} is called the z th layer UDD pulse tim- 
ing index. With fixed {jk}{= l+ i and {j k = N k + l}^, 
V jt,jt-i,—,ji w ith ji running from 1 to iV, + 1 constitute 
one cycle of UDDjVi, of total duration Sj e Sj e _ 1 ■ ■ ■ s 



Vje,— ,ji+i,ji ~ r lje,---,ji+i,o 



is presented in Sec. IV Numerical results for a 4-layer 
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where 



ence, which can always be modeled as 



'/.; .; = Vji,...,j i+ i,ji,Ni-i+l,...,Ni+l 

= Vji,...,ji+i,ji+l,0,...,Q 



with rjj e= Q = 0, and 



3 3 k 
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is the jf. pulse interval of the normalized UDDjvj. se- 
quence. 

Accordingly, the i th level f2, pulses are applied at the 
timings i r lj e ,...j i with ji — 1,2,..., JV^ , where = JVj 
if TVi even while Ni = Nt + 1 if iVf odd, and {j fc G 
{1, . ..,Nk + l}}fc=j+i- The additional pulse applied at 
the end of the sequence when iVj is odd, is required in 
order to make the total number of f2j pulses even, so that 
the overall effect of the fij pulses at the final time T will 
be to leave the qubit state unchanged [35l [39] . 



C. (n, ra, r«)-type error 

Each control operator £2j can divide the total Hamilto- 
nian H into two parts: one that commutes with f2j and 
another that anticommutes with f2j. Hence, by the pro- 
cedure provided in Appendix [XJ the total Hamiltonian 
can accordingly be divided into 2 l independent pieces by 
the given MOOS {Qi,fi 2 , • • • , as 



H 



(ri,r 2 ,...,ri) 



(6) 



where 



r > = 1 {^i > H (.r lt r 2 ,...,r e )} = 

r-j = => [fi; , -ff( ri ,r 2 ,...,r«)] = 



with i = 1, 2, ... £ and 



E = E E •• E 

{n=0,l}| =1 ri=0,lra=0,l r e =0,l 



(7) 
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H( ri ,r 2 ,...,r e ) is classified as an (ri,r 2 , . ..,r<)-type er- 
ror, a definition which includes all the operators that 
have the same commuting or anticommuting relation 
as fl"( ri)r2i- ., iT . £ ) with respect to a given MOOS 

{fil,fi 2 ,..-,fi*}. In particular, the 6 e = (0,0,..., 0)- 
type error, which commutes with all control pulses and 
hence is not suppressed by the NUDD sequence, is called 
a trivial error. All other error-types are non-trivial. 

For example, suppose the first and second pulse types 
used in 2-layer NUDD (namely, QDD) arc Z and X-type 
pulses, respectively. And the target quantum system we 
consider is a single qubit subjected to general decoher- 



H = J Q I®B + J x a x ®B x + JyG y ®B y + J z a z ®B z , (9) 

where B\, A G {0, X, Y, Z}, are arbitrary bath-operators, 
the Pauli matrices, <t\, A G {X,Y,Z}, are the un- 
wanted errors acting on the system qubit, and J\, 
A G {0, X, Y, Z}, are bounded qubit-bath coupling coeffi- 
cients. Then the MOOS {a z , cr x } divides the Hamiltonian 
^ into four pieces: one trivial error -ff(o,o) = Jo I ® Bq, 
and three non-trivial errors, -HVi o) = Jx<?x®B x , -H(o,i) — 
J z a z ® B z , and H(i,i) = J y cr y ® B y . 



D. 



, re )-type error modulation function 



Due to the discreteness of the pulse timings, it is eas- 
ier to perform the analysis in the toggling frame, i.e., the 
frame that rotates with the control pulses. Up to ±1 fac- 
tors, the normalized (T = 1) control evolution operator 
is 



UM = Q 



(10) 



when rj G [Vh,ji- lt ...Ji-i,Vjt,3i-i,-,ii) and ^(1) = J, 
the identity operator. 

Note that the commuting and anticommuting relation, 
Eq. Q, can be reformulated equivalently as follows, 



a h 



{r 1 ,r 2 ,...,r i 



} — (-l) ri #(ri. 



(11) 



for i from 1 to £, by using the unitary Hermitian property 

a 2 = /. 



Hence, with Eqs. ( 10 ) and ( 11 ), we obtain the Hamil- 



tonian in the toggling frame, 
H( V ) = U^(V)HU C ( V ) 

: E (/■)" 



(fe) re H t 



(r 1 .r 2 ,...,rt) 



(12) 



{n=o,i}f =1 

where 

fifr) = (-I)*" 1 V e [^,...^-1,^,...^) (13) 

is the normalized j th -layer modulation function for 
an ^-layer NUDD sequence, which switches sign only 
when the i th layer UDD^. pulse index, ji, changes. 

The factor J\i=i fiivY'i i- e -j the coefficient of the 
(ri,r 2 , • • ■ ,r £ )-type error H^. ljT3i _ tr ^ in Eq. Q, is 
called an (ri, r 2) . . . , r^)-type error modulation function. 
Note that since 



^i^^/j^e^' (u) 



where © is the binary addition defined as ordinary 
integer addition followed by the modulo 2 operation, 
Ijri-o jn-ij f ormg a 2 2 group under ordinary multi- 
plication. Likewise, for (ri,r 2 ,... , r^)-type error modu- 
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lation functions, we have 

n i i 

p—l i—l i—1 



(15) 



which indicates that the set of the (ri, ra, . . . , r^)-type 
error modulation functions constitutes a group. 



E. Dyson expansion of the evolution operator in 
the toggling frame 



With the shorthand notations, 

f t = (n,r 2 ,...,r e ) e {0,1} 5 

and 

E 

fi {n=o,i}| =1 



(16) 



(17) 



where the subscript £ of the vector fi indicates that re. 
has £ components, the Dyson series expansion of the evo- 
lution propagator in the toggling frame, 



U(T) =Texp[-iT H{rf)drf\, (18) 
Jo 



reads 



n=0i ; Win p=l 
V i i P =i 



where the operators in this expression are 



4? - n v = • ■ ■ ^ (20) 



P =i 



(note the ordering), and the scalars are 



pW = f 



n / \{f^¥ p) d^\ 

P=1 J ° j=l 



(21) 



with 7y( n+1 ) = 1, and where ry is the i th component of 

ff . Our task will be to find the conditions under which 
these F _(,,) coefficients vanish, which will dictate the 

decoupling orders of the NUDD sequence. 
Since 



p—l p—l 

/ n 

=(-i) e ^ p, n^ (22) 
P =i 

where the first equality is obtained by inserting f2? = I 



between any adjacent -ff-< P ) and the second equality 
is obtained by using Eq. |lT] ), ]Jp = iH-,( P ) either com- 



mutes [(3p =1 r i 



0) or anticommutes (©p =1 r. 



(/>) 



1) 



with each control pulses operator Hi. In other words, 
Ilp=i HfW belongs to an f^-type error whose compo- 



nent r. 



>) 

i > 



and can be denoted as either 



~( P ) = - , or ifi [as in Eq. (20)] if we only care 



about the resulting error type re. Therefore, the set of 2 
(n, f2, • • • , r^-type errors Hp form a group under 
multiplication. 

Note the key role played by 



re 



(ri,r 2 , . . . ,r e ), 



(23) 



There are 2 such vectors, and they completely classify 



all the summands in the Dyson series ( 19 ) 



NUDD coefficients 



From Eq. (21), F is a normalized n-ncstcd in- 

■ — ■ ® P =i r f 

tegral (total duration T = 1), and the f^ in its sub- 



script indicates that the p th integrand of .F , p) 



is the 



tip) 



(p) Ap) 



) 2 ' ' 



)-type error modulation func- 



tion rii=i /iC 7 ? ' P ^) riP ■ According to F „( P )'s associ- 



ated error type H 



„ (or .iff!, for short), we can 



also denote F w as F (p) or f!" 5 [Eq. (1211)] , 



and name it the n order normalized ^-layer NUDD re- 
type error coefficient. If no confusion can arise we will 
sometimes call F ^ P > an "NUDD coefficient" for 

short. 



Let us consider a couple of examples. From Eq. (21), 



the form of the 1-layer NUDD (fi = l)-type error coeffi- 
cients 



p=l U 



n 



/i()jW) r ? V p) (24) 



is exactly the same as the UDD coefficients appearing in 
[2"2] for the pure dephasing model where the dephasing 
term a z <g> B z is our (fx = l)-type error and the Pauli 
matrix a x is our control pulse operator. 

Moreover, the form of the 2-layer NUDD r 2 = (r l7 r 2 )- 
type error coefficients 



F„ 



n 



/l ( ?? (p))^ , / 2 ( ?7 (rt)^ ) ^(rt 

(25) 



is exactly the same as the QDD coefficients defined in our 
earlier work [33] , where a x ® B x , a z ® B z , and a y ® i?. y 
are our (1, 0)-, (0, 1)- and (1, l)-type errors, respectively, 
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while the Pauli matrices a z and a x are our first-layer and 
second-layer control pulses, respectively. 



Note that from Eq. (21 1, the NUDD coefficients are 



actually the same no matter what the control pulse op- 
erators are, as long as they are independent and consti- 
tute a MOOS. Therefore, the proofs for the performance 
of UDD and QDD sequence in [35] and [39] , which used 
single qubit Pauli matrices as control pulses, apply di- 
rectly to 1-layer NUDD and 2-layer NUDD schemes with 
more general control pulses. 



where 



N 



when i < oi 



min[A^' <J + 1, Ni] when i > o%, 



(30) 



and where the o\ th layer is the first UDD layer with odd 
sequence order denoted as N 0l , and 



k <i 



mm{N k , | [N k ,} 2 = 1}. 



(31) 



III. PERFORMANCE OF THE NUDD 
SEQUENCE 

A. The decoupling order of each error type 

As we observed, the summands in the Dyson series ex- 
pansion of the evolution propagator Eq. ( 19 ) are each 
classified as one of 2 e types of errors. Therefore, for a 



given f^-type error, if all of its first N? t NUDD coeffi- 
cients vanish, then we say that the £-layer NUDD se- 
quence eliminates the retype error to order N? e , i.e., Np t 
is the decoupling order of the 77-type error. 
Let us define 



and 



[a:] 2 = (x mod 2) 

b 

p©(a, b) = rk[N k ] 2 if b > a, otherwise (27a) 

k— a 
b 

p+(a, b) = 2_. r k[Nk\2 if b > a, otherwise (27b) 

k— a 

The value of [Nk] 2 indicates the parity of the sequence 
order N k of the k th UDD layer, i.e., [Nk]i = or 1 when 
N k is even or odd, respectively. Hence p®(l,i — 1) £ 
{0, 1} gives the parity of the total number of UDD layers, 
each of which has odd sequence order and control pulses 
that anticommute with the r^-type error, in the first i — 1 
levels of NUDD. Likewise, p+(i + 1,£) counts the total 
number of those UDD layers after the i UDD layer, with 
odd sequence orders and control pulses anticommuting 
with the 77-type error. With this in mind, we shall prove 
the following theorem in Sec. |IV| 



NUDD Theorem 1. An i-layer NUDD scheme with 
a given sequence order set {Ni, N 2 , ■ ■ ■ , N^} eliminates 
ri = (ri, T2, . . . , ri)-type errors to order N? e , i.e. 



4r } =0, Vn<iV f „ 



(28) 



with the decoupling order of the rg-type error being 

i%= max [r i {p & {l,i-l)®l)N i +p + {i + l,£)] (29) 
• ■ ! : ' l 



Let us proceed to explain this theorem before embark- 
ing on its proof. First, note that iV*5' is simply the 
minimum value among all the odd sequence orders of the 
first i - 1 UDD layers of NUDD. As we shall see, iV, 
is the suppression order of the i th UDD layer, in con- 
trast with the sequence order Ni of the same layer. By 
Eq. @, iVi < 
Ni = N^' <1 + 



Ni. When Ni < Ni occurs, we have 
1, which suggests that the suppression 
order of the i th UDD layer is partially hindered by the 
UDD layer with the smallest odd sequence order, which 
is nested inside the i th UDD layer. 

The coefficient in front of the suppression order Ni 
in Eq. ( 29 ) is 1 if and only if both = 1 and p® ( 1 , i — 
1) = are satisfied, and vanishes otherwise. Accordingly, 
(26) there are two requirements for the i th UDD layer to be 
effective on a given r^-type error. First, the error must 
anticommute with the control pulses of this UDD layer 
(ri = 1). Second, the error must anticommute with a 
total even number of odd order UDD layers among the 
first (i — 1) layers [p®(l,i — 1) = 0]. 

In contrast, if an error anticommutes with a total odd 
number of odd order UDD layers among the first (i — 1) 
layers [p©(l, i— 1) = 1], then the i th -layer UDD sequence 
is totally ineffective in suppressing this error type, even 
though this error also anticommutes with the control 
pulses of the i UDD layer. 

Note that for the trivial error type, the 0^-type error, 
we have Nx — 0, which by Eq. (28) implies — 







for n < 0. This should be interpreted as saying that the 
vanishing of the 0^-type errors cannot be concluded from 
Theorem [TJ 

In the following, we discuss the decoupling order for- 
mula Eq. (29) for some particular examples of NUDD 
schemes. 



1. 1-layer NUDD (UDD) 

Since 1-layer NUDD has £ — 1, for the decoupling order 
of (fi — l)-type error, Eq. (29) gives 2V"p 1=1 = Ni. By 
definition of Ni [Eq. d30b], the suppression order of the 



first UDD layer N x is always equal to its sequence order 
Ni, i.e. Ni = Nx, irrespective of Ni's parity. Therefore, 
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we have 



5. NUDD with all odd sequence orders 



N fl=1 = N, 



(32) 



which shows that the non-trivial error is eliminated up 
to the UDD's sequence order, in agreement with [2"Tll22) . 



2. 2-layer NUDD (QDD) 

For 2-layer NUDD with a sequence order set {N\, A^}, 
the decoupling order formula Eq. |29|) simplifies to 



N ?2 = maxlnNt + r 2 [N 2 ] 2 ,r 2 (n[JVi] a © l)JV a ]. (33) 
Specifically, we have 

#(1,0) =N x =N r 



N (0 ,i) = max[[iV 2 ] 2 , ( © 1 )N 2 } = N 2 
N (lil) = max[A\ + [N 2 ] 2 , ( [JV x ] a © 1 )N 2 } 



(34) 



If we identify the (l,0)-type, (0, l)-type, and (l,l)-type 
errors to be cr x , a z , and a v errors, respectively, Eq. ( [34] ) 
agrees exactly with the results obtained in our earlier 
QDD work 



For any NUDD scheme with all odd sequence orders, 
in general, N — AT™ + 1 < JVj could occur, which 
suggests that the suppression ability of the i th UDD layer 
is hindered by one of the odd order UDD layers inside the 



; til 



layer. 



Nevertheless, for the case of Mayer NUDD with a 
[Ni\ 2 = 1 Vi and Nt > N 2 > ••■ > N e , N = N 
[Eq. (30)] is guaranteed . Moreover, it is easy to show 
that 



k=i+l 



r k > Nj + 



k=J + l 



(37) 



for any i < j. Accordingly, due to Eq. (37), it turns 



out that the maximum value in Eq. ( 29 1 occurs at the 



inner-most UDD layer that the error anticommutes with. 
Suppose the first non-zero component of an r^-type error 
is r M = 1. Then it follows that the M th UDD layer has 
the maximum suppression on this error, i.e. 



N. 



M 



E 

k=M+l 



Tk ■ 



(38) 



3. NUDD with all even sequence orders 

For f-layer NUDD with [A^] 2 = Vi < £, the suppres- 
sion order of each UDD layer is equal to its corresponding 



sequence order Ni, N — Ni [Eq. (30)]. Therefore, the de- 



coupling order formula Eq. ([29]) for this type of NUDD 
schemes reduces to 



The second term of the above equation implies that the 
outer-odd-UDD suppression effects generated by UDD 
layers with odd sequence orders outside the M th layer all 
add up. In other words, the outer-odd-UDD suppression 
effect is cumulative. 



B. The overall performance of NUDD scheme 



N Pe = max [nN] = max [nN], (35) 
ie{i,...,e} ie{i,...jy 

which indicates that the suppression ability of the UDD 
sequence in each layer is unaffected by the other nested 
UDD sequences, i.e., successive UDD layers do not inter- 
fere with one another. 



4- NUDD with all even sequence orders except the 
outer-most UDD layer 

For Mayer NUDD with [N} 2 = Vi < £, and [N e } 2 = 
1, we again have 



decoupling order formula Eq. ( 29 1 reads 



N [Eq. (30)]. Therefore, the 



N Pt = max[{ nN © rt}i = l,rtN £ ] 



(36) 



From Eq. (36), we can see that if a given error anticom- 



mutes with the outer- most UDD layer, namely, rn = 1, 
then the outer-most layer boosts the suppression abilities 
of all inner UDD layers by one additional order. We call 
this the outer-odd-UDD effect. 



The overall performance of an Mayer NUDD scheme 
is quantified by the minimum over the decoupling orders 
of all error types, i.e., by 



N„ 



min[A^ 

{■re} 



(39) 



We call Af m i n the overall decoupling order. The following 
corollary of the NUDD Theorem states the relationship 
between the overall decoupling order A^ m i n and given se- 
quence orders {N}: 

Corollary 1. The overall decoupling order N m i n of an 
(.-layer NUDD scheme with a given sequence order set 
{N u N 2 ,...,N t } is 



A^min = min [N] 

»£{!,...,<} 



(40) 



Proof of Corollary^ For a given error type ri ^ 0i, sup- 
pose the inner-most non-zero component is rj = 1 with 
i > 2, i.e., r k<i = 0, so that p®(l,i — 1) = 0. By def- 
inition for i — 1 we also have i — 1) = 0. Due to 
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rj(p®(l,i — 1) © 1) = 1 and N? t taking the maximum 
value over the set displayed in Eq. ((29|, it follows that 



(41) 



N Fl > ri(p©(l, i - 1) © l)iVj = JV< 



Among re ^ 0^ error types, there are error types which 
anti-commute with only one control pulse type denoted 
as e*i with ri — \ and all r^i — 0. According to Eq. (29 1, 
the decoupling order of the e^-type error is 



N& = N 



(42) 



Owing to Eq. (41 1 and (42 1, A m ; n , the minimum among 



QDD proof [33] : the outer-most-layer interval decom- 
position ("Method 1"), and the nested integral analysis 
with certain function types ( "Method 2" ) . Each method 

allows us to show the vanishing of for some error 
types, and when put together the two methods complete 
the proof. 

In Sec. [!VB| using Method 1, the Mayer NUDD co- 
efficient Fi n ^ is expressed in terms of the (£ — l)-layer 



decoupling orders of all non-trivial error types [Eq. (39)], coefficient F^ vanish, i.e., 



NUDD coefficients. Then, in Sec. IV C we show that it 
is the vanishing of the (£ — l)-layer NUDD coefficients 
that makes the first N? l _ 1 orders of the £-layer NUDD 

(n) 



occurs among the decoupling orders of all e^-type errors, 
i.e., 



Vn < N Pt _ 



(46) 



i, ii j i ■ "«"."r,j — ._ip m „ Wei] — _ mm „, [Ni] (43) 



iV min = mm\Np, 



*e{i,. 



ie{i, 



Suppose that among the t layers, the o\ , o\ , ■ • ■ , and 
of 1 UDD layers where < o x < o 2 < • • • < o& < I are 
the layers with odd sequence orders. By the definition 
of N. t [Eq. ([30])], for i < oi, N % = N h while for o S - X < 
k < 0j with j e {2, . . . , b}, N k = min[A 0l + 1, N 02 + 
1, . . . , N Qj _ 1 + 1, Nk]. Then it follows that 



due to the inductive assumption Eq. ( 45 ) 



Further, using the outer-l ayer in terval decomposition 



form of F^ , we show in Sec 
type error, which anticommutes with the control pulses 
of the £ th UDD layer, if the £ th sequence order is odd 
then due to the anti-symmetry of the £ th UDD layer, the 
first I — 1 UDD layers in fact suppress the error by one 
more order, namely, 



IV D 



that for the (ri- 



min \Ni] = min \NA 
ie{i,...,iy ie{i,...,ey 



which proves Eq. ( 40 1 



(44) 
□ 



IV. PROOF OF THE NUDD THEOREM 

A. Synopsis of the proof 

Our proof is by induction. To establish the base case, 
we recall that (as already discussed in Sec. II F) the 



proofs of the vanishing of UDD coefficients in |22j and 
QDD coefficients in our earlier work |39j , which used sin- 
gle qubit Pauli matrices as control pulses, are also valid 
for the 1-layer and the 2-layer NUDD schemes with a 
more general set of control pulses (MOOS). From [55] 
and [35], the decoupling orders of each error type for 
UDD and QDD match exactly with Eq. p9l. Accord- 



ingly, the NUDD Theorem holds for the 1-layer and the 
2-layer NUDD sequence. 

Suppose that Theorem [l] holds for (£-l)-layer NUDD 
with an arbitrary integer £ > 2, i.e. 



(45) 



Then the remaining task is to show that Theorem [T|— 
in particular Eqs. (28) and (29) — also holds for €-layer 
NUDD. The procec 



ure is the following. 



can be re-expressed in two different forms, build- 
ing on two different methods adapted from our earlier 



(N f . , +1) 



(47) 



Combining Eq. (47) with Eq. (46), Method 1 gives rise 



to 



F^ ] =0, VnKN^+nlNfo. (48) 



Let us now define two special cases, by parity: 

vf = ft with = a, ae{0, 1} (49) 

The decoupling order of the (■y f °_ 1 , l)-type error which 



by Eq. (49) includes the (Oe-i, l)-type error, is derived 
independently by Method 2 in subsections |IV E|IV G| 

The second part of the NUDD proof is summarized 
as follows. In Sec. IV E we apply a piecewise linear 



change of variables to , such that the Fourier expan- 
sions of all its integrands, ]X=i fiivY'i belong to cer- 
tain specific function types. In particular, for NUDD 
with all even inner sequence orders, there are only two 
function types, {c^ e >°, C^d°}, while for NUDD with at 
least one odd sequence order, there are four types, 

{c^°,C^d ,Co™n:C^d°}- These function types are de- 
fined as follows: 



Definition 1. Let k,q € Z with \q\ < n and let dki G 



be arbitrary, 
name 

Ne,n 
C odd 

„N t ,n 



/-N t ,n 
seven 



B. The outer-layer interval decomposition form of 



Sodd 



function type 

d kl cos[(2fc + 1)(A^ + 1)6* + q6] 

k 

J2 d k2 C0s[2k(N e + 1)9 + q9] 
k 

dm sin[2fc(iV, + 1)9 + q9] 

k 

J2 dki sin[(2fc + l){N t + 1)9 + q&\ 



Also, leto denote the product-to-sum trigonometric func- 
tion operation. 



rt 



(n) 



It is expected that the (ft—i ^ Ot—i> r£)-type errors, 
which anticommute with one or more of the first (£ — 
1) inner-layer control operators, are suppressed mainly 
by the inner (£ — l)-layer NUDD sequences. In order 
to extract the action of the inner (£ — l)-layer NUDD 
scheme and factor out the action of the outer-most (£ th - 
layer) UDD sequence, we employ the outer-layer interval 



decomposition method, which splits each integral of 



(n) 



[Eq. (21 1] into a sum of sub-integrals over the £ th -layer 
UDDjv f pulse intervals Sj e [Eq. dEJ]. 



By utilizing the group properties of {c^„,C^dd°} ( or 
{c^CfX^°,C£}), after integrating out the first 

(n) 

n — 1 integrals of , we obtain the second expression 
forFi n) , 



„N e ,n 
"odd 



d6 



= 2_^d k sin[(2fc + l)(N e + 1)9 + q9% 



= 0. 



(50) 



for the (vg_i, l)-type error, where n < N& for NUDD with 

all even inner sequence orders, and n < mm[N^Jr^ + 
1,N(] for NUDD with at least one odd inner sequence 
order. The vanishing orders of the other types of er- 
rors cannot be deduced from this method and we denote 

(n) 

this fact by: Fz L-. a „ = 0, Vn < 0. In conclusion, 
Method 2 yields 

F^ = 0, Vn<n[(p e (l,£-l)®l]N e . (51) 



The derivation of given in Appendix |b| shows that 
each ^-layer NUDD coefficient can be expressed in terms 
of the (£ — l)-layer NUDD coefficients as follows, 



re=(fe-i,re) 



E 



o=l 



(iKtfxn e (- 1 ) 

' £-1 



•Ca+1) 

m j£ 



Ui a) -l)r< a > 



a=1 3 y=a 



(53) 



where j^ n+1 ' — 1 = Nt + 1, and for given m, 
^-'{J2 m " =«} sums over au possible combinations of 
{ n aYa=i under the condition J2T=i n a = n i with 1 < 
n a < n. From the derivation of Eq. ( 53 ) in Appendix 



|B} one can see that each segment with a specific config- 
uration {m, {fiajaLi} defines a unique way to separate 
the n vectors {r/ p ^}p =1 into m clusters. The a th cluster 
contains n a vectors, {rj^y^T 1 nk+n " 



P=J2l = i n k +l 



with n = 0, and 



is the resulting vector of the a cluster, i.e., 



Combining Eq. (48) with Eq. (51), we have 



(54) 



4;) = o 



(52) which implies 



Vn < max[JV rVl + rt[N e } 2 ,rt((p B (l,£ - 1) ffi l)Nt 



Equation ( 52 ) is equivalent to the decoupling order for- 
mula Eq. ( 29 1 given in Theorem [l] as can be seen by sub- 
stituting the explicit expression for Nf e _ 1 , i.e., Eq. (29) 
for (£- l)-layer NUDD, into Eq. ([52]). Therefore, The- 
orem [T] also holds for the £-layer NUDD scheme with 
arbitrary sequence orders. The induction method also 
implies that Theorem [T] holds for any number of nested 
layers of NUDD. 



ri, i.e 



7 a =l' £-1 

;>m (a) 



rt-i 
re- 



(55a) 
(55b) 



Moreover, the a cluster is associated with F 

order (£— l)-layer -type error NUDD coefficient, 



(«o) 

I— 1 



and with £ j {>_ 



which contains 



the outermost (£ ) UDD layer's information. 
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C. The performance of the inner (I — l)-layer 
NUDD 



D. One more order suppression of the (ft-i, l)-type 

error 



We shall show that it is the vanishing of the inner 
(£-l)-layer NUDD coefficients, = 0, Vn a < ft^ a) , 



in) 



' i-i 

vanish. 



that make the first N?. , orders of F- 

First, we have the following lemma, with the proof 
given in Appendix [Cj 

Lemma 1. ft? < Y,7=i ft ?<°> > where H-i = 

' e— i 



In the previous section, we proved that F^_,^ i r ^ 



for all n < ftf l _ 1 [Eq. (46)]. Now we shall show that 
for (Ft— i, l)-type error, there is an additional order sup- 



pression, i.e., F,-. 



(ri-i,l) 



= [Eq. d47|], if A^e is odd. 



First, we need the following lemma: 

Lemma 2. For J2T=i n a = n = iVp < _ 1 + 1 with m > 2, 

there must exist at least one a such that n a < ft-(a) 

r i-i 

Proof of Lemma [J| With Lemma [l] we have 



With Lemma 1 for Fi, ls \ with n < N?. , , each 

a=l' (-1 



Iir=i F Mu) with ®^ =1 r^' 1 = Fi-x in Eq. (pj satisfies 



E n " 

o=l 



n< JV^ 



(56) 



From Eq. (56), it follows that there must exist at least 



one a from 1 to m such that n a < i\L< a > , because if 
n a > ft -{a) for all a, it leads to 



!>>E%»> 



(57) 



a=l 



which contradicts the assumption Eq. ( 56 1 . Accordingly, 



each FJaLi ^i"*) term of 



with n < iVV, 



contains at least one F\ ( ° } with n a < N^m , which 

r «-i 



turns out to be zero due to the assumption Eq. ( 45 ) 



for (I — 1) NUDD coefficients. Therefore, the first step 



^r=w-i.ro = ° for ai1 n - N ?t-i Eq - ^ is pr ° ven - 

From the proof, one can see that the effect of the 
outer f-layer UDD sequence which is entirely contained 

in the outer part 7]™=! eC^'V 1 ) *" 5-1 ^ «!& in 

Jfi — a J B 



a<J2 N r^ +1 - 



E 1 

a— 1 a— 1 



(58) 



For m > 2, pick an arbitrary n a i among {n }™ =1 . If 
n a ' < ft^a'), then Lemma |2j is true. On the other hand, 



if n a > > ft^a 1 ) +1, subtracting n a / from Eq. ( |58[ ) leads to 

(59) 



E n " 



<^%i +i-n„< > 



E 



a=l 



Accordingly, by the same counterargument mentioned 
in Sec. IV C there must exist at least one a" ^ a' 

such that n a ti < ft^< a ">, for otherwise we would obtain 

r i-i 

Y,a+a> n a > Y^a^a' N ?< a > wmcn contradicts Eq. Q. □ 

With Lemma [2j it follows that for the outer-layer 
decomposition form of F^ e ~^ Eq. (53), in each 



F ^a) wi th m > 2 and J2T=i n a = n = ft? t _ 1 + 1, 

r e-i 

In ) 

there must exist at least one with n a < i\L<a> . Due 



to the induction assumption Eq. (45), all JI^Li F -.(*) 



with m > 2 in Eq. (53) vanish. Therefore, only the 



{to = 1, n a = n} term, F~ r ' 1 , remains in the outer- 



Eq. ([53]) does not interfere with the elimination ability of layer decomposit i on f orm f F [ 
the inner [l — l)-layer NUDD sequence. 

Note that for the Ft, — (0^-i, l)-type error, N$ i = 



In particular, 



for the (Ft-i, l)-type error, we have 



by Eq. (|29| gives rise to F& = for all n < N 5( 



[Eq. (46 )]. However, this does not mean that F^ ^ ^ 

(n) 

in our context, but rather that the vanishing of F,-, 

(Of-i ,1) 

cannot be deduced by the just-mentioned method, which 
only considers the contribution of the first (£ — 1) UDD 
layers. This makes sense because the Ft = (Ot-i, l)-type 
error, which commutes with all the control pulses of the 
inner (£ — 1) UDD layers, is supposed to be suppressed 
by the £ th -layer UDD sequence. 



JN?., +1) 



Ni + 1 



Em: 



(60) 



Despite F^- 1 ' , the outer part E^=i ^ 
turns out to be zero due to the cancellation of opposite 
signs between je and N e + 2 — j e when N ( is odd, but oth- 
erwise equal terms Sj e — SN e +2-jt (time-symmetric prop- 

erty of UDD) in the sum. Therefore, F~ i 1 ^ =0 and 
Eq. (|47j) is proven. Combining Eq. ((471) with Eq. ((46]), 
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which we proved in the previous section, leads to 
for n < i%_ 



in) 



■r e [N e } 2 , i.e., Eq. g§. 
From the proof, one can see that the anti-symmetry of 
the £ th outer-most UDD layer with odd sequence order 
can help the inner (£ — 1) NUDD sequences to suppress 
the (r^_i, l)-type error by one more order. 



E. Fourier expansion after linear change of 
variables 



We shall complete the proof of Theorem [T] by another 
approach which analyzes the nested integral with certain 
Fourier function types. First apply a piecewise linear 
transformation. 



N e + 1 



V-Vh-i 



N e + 1 



V e [rjj t 



(61) 



with ji € {1, . . . , Nt}, to the n-fold nested integral Fi f 
With drj = Gi(6) d0, F^=F & 
pressed as 



(n) 



„ - w Eq. (21| is reexp- 
p=i' t 1 — 1 



pin) 



n 

P =i 



Gi(6»W) J[fi(6^) r i P) de^ (62) 



with 

g|(n+l) 
fi(0) 

Gi(8) 

where 9 is the new pulse timing. 

The advantage of using the piecewise linear transfor- 
mation Eq. (61) is that all the modulation functions be- 



71", 

(-1)*" 
N, + l 



*3l 



e [0*,.. 
e [9 h . 



(63) 
(64) 

(65) 



come periodic functions. Consequently, as explained in 
Appendix [Dj it turns out that the Fourier expansion of 
each function appearing in F -< P ) belongs to one of 

the function types from Definition [T] 



*(Gi(0)) = C, 



/-N e ,0 
Sodd ' 



r Ni,0 
°evcn 

Seven 



when Ni even 
when Ni odd 



■even J 



(66) 
(67) 
(68) 



where VP maps a function to the function type of its 
Fourier expansion up to unimportant coefficients. 

Note that the sets fdSf.CS } and 



{ r N e ,0 fNtfi /-N t ,0 
l L cvcn > Sodd ' Se 



dd ' Seven , °odd 



the Sets {^en'Codd } 

c odd°} constitute Z 2 and Z 2 x Z 2 



groups with c^. f c '° as identity, respectively, under the 
binary operation o (the product-to-sum trigonometric 
formula). Then one can obtain the function types of 
the r^-type error modulation functions by employing the 



group algebra of Z 2 and Z 2 x Z 2 to 

e 



(69) 



i=l 



where *(/i(0)° = 1) = for all i. For ex- 

ample, for the u/^-type error [see Eq. (49)], its 
ofci*(/i(0) n ) would result in because the con- 

dition ® k Z} 1 rk[N k } 2 = 1, that the components of vl_ 1 
satisfy, implies that there is a total odd number of n = 1 
associated with [JV<] 2 = 1 such that *( /;(0) n ) = Cevin°- 
It turns out that ^(n«=i fi(@) Vi ) i s determined by only 
two values, ® e k ZX r k[Nk}2 and rg, (see Table [ij. 



error type ri 


*(nti/»w n ) 


(i) K°-i,o) 

(2) 

(3) (Ci,0) 
(4) 


©j^W^fe = 0, r e = 
®tZ\rk[N k ] 2 =0, r £ = 1 
©i=W^fc]2 =1, r/ = 
©fcW-Njkja =i, n = 1 


°evcn 
>iV,,0 
''odd 

'even 
c odd 



TABLE I. The first column classifies 2 f^-type errors into 
four groups by two values, (B e k ~l\rk \Nk\2 and rg, where V('_ 1 



and are defined in Eq. (491. The second column shows 



the function types of the Fourier expansion of the ff-type 
error modulation functions. 



Note that for the flayer NUDD with [Ni] 2 = OVi < 
£— 1, ® l tZ\ r k[Nk]2 is zero for all 2 e r^-type errors. There- 
fore, n«=i fi(0) Vi ) f° r an error types in this case is 

either Codd or c even, tne nrst tw0 rows m Table |l[ On the 
other hand, for the £-layer NUDD with at least one UDD 
layer with odd sequence order, there are four functions 
types as shown in Table |TJ 

The following expression of F^ focuses on the func- 
tion types of the integrands, while the coefficients in their 
Fourier expansion are unimportant in the proof, 



Fi n) = o n , 

r, v p=l 



d6 M * ( Y[ fi (9 (p) ) r - ) o * ( Gi (9 (p) ) 
i=l 

(70) 

As a matter of fact, due to 



*(]I/iW r ' 15 )<>*(![/< w p<(s) ) (7i) 

i=l i=l 



i=l 



^ is a homomorphism from the Zf e group {nf=i hi^Y 1 } 
to either the Z 2 group {c^°, C dd°} or tne ^2 x Z 2 group 

( r Ni,0 /-Ni,0 r N e ,Q r Wt,0\ 
l c even 5 Sodd ' Seven , L odd /* 
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F. Integrating out the first n — 1 integrals of Fp 



(n) 



coefficients, i.e. 



We shall prove the following lemma, 



Lemma 3. For all n < A + 1, the form of F^ after 
?i — l integrations becomes 

F^= r de^^(t[M0^p)o^(G n (e^)) (72) 

Jo 4=1 

up to different unimportant coefficients in the Fourier 
expansion, where $(G n («W)) = C^en"- 
*(IlLi/i( 6 ' (n) ) ri ) ff* uerl explicitly in Table R all the 
possible function types of the resulting integrands of F^ 

*(Ili=i/i(^ (n) ) ri )**( G n(^ (n) )) inEq. Q are listed 
in Table [771' 



error type 


the resulting integrand of F^ 


A r - f 


(1) (^i,0) 

(2) (^,1) 

(3) (Ci.O) 

(4) 1) 




/-Ne ,n 
xvcn 
ATf ,n 
C odd 

L cven 
''odd 


oo 
N t 

oo 



TABLE II. The second column shows the function type of the 
resulting integrand of F~ n ' after n — 1 integrations. The third 
column shows the maximum order A? e up to which the func- 
tion type of the resulting integrand of F^ does not contain 
a constant term. 



Af f is defined as the maximum order such that the 

function type of the resulting integrand of F^ does not 
contain any constant term, and 

A = min[{A r -}]. (73) 



The proof of Lemma[3]is done by induction. It is trivial 
that Lemma 3 is true for the first order F^ based on 
the form of Eq! (70). Suppose Lemma [3] holds for all the 
(n — l) th order NUDD coefficients where n — 1 < A, i.e., 



(74) 



where *(IlLi M Cn_1) ) r< ) o *( G„_i(6> ("- 1 ') ) belongs 
to one of Ic^."- 1 c Ne ' n ~ l c Nl ' n ~ l c N t< n - 1 \ 

IU U11C Ul ((, even i Sodd i 'seven ' c odd J * 

To proceed to the next order, first compare the forms 
of Eq. (70) between the n th order and the (n — l) th or- 
der NUDD coefficients. One can see that the n th order 
NUDD coefficients can actually be viewed as one inte- 
gral nested with one order lower [(n — 1)* -order] NUDD 



( d9 {n) y(f]fi(0 in) ) r < n) )o^{G 1 (e in) ))oF < : r ; 



(75) 



where re 



© f e implies that r e could be a different 



vector from 77, and the extra superscript 9^ of Fz? 
indicates that the upper integration limit, w, of the last 
integral of i^™ -1 - 1 is replaced by 9^ n \ Therefore, we can 



just substitute the results of Fp ^ [Eq. (74)] directly 



into Eq. (|75|) to evaluate Fi 



(») 



From the definition of A, the resulting integrands of 
f[" ^ for all possible r e are guaranteed to contain no 

constant term for n — 1 < A. Accordingly, it is straight- 
forward to check that the operation 



tf(Gi(0< n >)) o 



do^-v 



(76) 



maps {cg*£ SCodd™ 1 ,Codd *} to its corrc - 

sponding {c^,^",^'" ! c odd™} regardless of the 
unimportant change of the coefficients in the linear com- 
bination. Therefore, up to different coefficients in the 
linear combination, the operation Eq. ( 76 ) gives rise to 
the following mapping, 



— ► 



^(UMd^-'Y') o *(G„_ 1 (0("- 1 ))) 

i=l 

I 

Hl[fi(0 (n Y l ) o *(G„(0 (n) )). (77) 



(n) 

integrand of F~ becomes 



By substituting Eq. (77) into Eq (75), the resulting 



i=l i=l 

(78) 

Applying the homomorphism property Eq. ( |71| to the 
above equation, we obtain Eq. (72) for the rr h order 
where n < A + 1 . 

For the order n = A + 1, the resulting integrands of 
F- for some of the errors begin to contain a constant 

term. Then the operation Gi(0 (ll+1) ) o d0^ 
will map them to different functions other than a purely 
cosine series or a purely sine series. Hence, it follows that 
Eq. ( 72 ) or the second column of Table |n] are no longer 
true for the A + 2 or higher order NUDD coefficients. 
Now let us prove the claimed values of A^ shown in 



the last column of Table |ll| By Definition [l] c^™, which 



is the function type of the resulting integrand of the 
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l) - type error NUDD coefficients in Table fill does Due to 
not contain a cosine function with zero argument(a con- 
stant 1 term) when n < N e . Therefore, A ( ^o_ x) = N e . Lemma 4. min^iV^^^J] = N%£*, 



Neither Cwen nor Cdd" ever nave constant terms, which 



indicates that A^o 



,,0) 



Ac,-? 



(^-1,1) 



00. Only for the 



0)-type error, c^™ is in general (by definition) al- 
lowed to have a cosine function with zero argument, a 
constant term. However, in Sec. |IVCl we already showed 



that Fi n) = for all n < N i ? e _ 1 [Eq. (08])]. Hence, with 



Eq. (48), 



Fi 



(«) 



<%£dB Vn<^ £ _ l60iLi (79) 



suggests that q^4'™ in Eq. ( 79 ) has no constant term for 



^cvcn 
y r e _ 1 Gvl_ l 



the first N#. . c „-ri orders. Tuxordingly, 



A f« 0) - N? t - 



(80) 



G. The vanishing of the (v(_ 1 , l)-type error 

According to Lemma 3l for the (Vg_i, l)-type error, 



(n) 



1,1) = ^ c odd" where c 0( £f does not contain any 



constant term, for all n < min[A + l, Ne], where n < A + l 
ensures that the function type of the resulting integrand 

Ng ensures that 



of i\ is c^fj,™, and n < A^o 



u odd ' 



(^°-i.l) 



there is no constant term in it. Then as shown in Eq. ( 50 1 , 

for 



it follows that F^d 



(n) 



n < minfA 



l,N e ]. 



(81) 



where the value of A expressed explicitly in terms of the 
sequence orders shall be determined as follows. 

For the NUDD with [N] 2 = for all i < £, 
ffifc=i r fe[-^fc]2 = is always true for all errors, so there 
are only two error types, (u^°_ 1 ,0)- and {v e °_ 1 , l)-type er- 
rors in this case. Therefore, from the definition of A 



[Eq. (73)], we have 



A = min[A a o_ ii0) , A ( so_ 1 



)] = min[oo, Ne] = Afe. (82) 



which leads to min[N e + l,N e ]=Nt in Eq. ([81]). There- 
fore, we have proven that 







Vn < N t 



(83) 



for the NUDD with all even inner sequence orders. 
For NUDD with at least one odd sequence order in the 



first £—1 UDD layers, from the definition of A Eq. (73), 

A = min[A (i7(! o_ ii0) , A^o ^!), A FfS(i7f i_ ii0) , A^^i ^!)] 
= min[oo, N t ,{N Pi ^1^,00] 

= mm[N e ,{N Pe _ i ^ Li }]. (84) 



which we prove in Appendix E Eq. ( 84 ) reads 



A = mm[N e ,N^ <*] 



(85) 



which gives rise to 

min[A + 1, N e ]= mm[min[A^, N%Jf] + 1, Ne] 

= mxn[N%' <e + l,N e ] (86) 



in Eq. (81 ). Therefore, we have proven that 



F, 



( ( 4i,d = Vn ^ min KJ; £ + 1, n £ 



k'<£ 



(87) 



for NUDD with at least one odd inner sequence order. 



Combining Eqs. (83) and (87) with the fact that the 



vanishing of the other error types cannot be deduced from 
this approach due to their function types (shown in Table 

|nj), we obtain Fif = for all n < n[(p m (l,£ - 1) ffi 
l]N e [Eq. ^], where N e = N e for NUDD with all even 
inner sequence orders, and Ni = mixi[Ng f + l,Nt] for 
NUDD with at least one odd inner sequence order, which 
is equivalent to the definition given in Eq. ( 30 ) . This 
completes the proof of the NUDD Theorem. 



V. NUMERICAL RESULTS: 4-LAYER NUDD 

In this section, numerical simulations are employed to 
examine the performance of a 4-layer NUDD scheme for 
two contrasting MOOS. Given the UDD sequence or- 
der set {Nj}j =1 , the proof given above ensures a mini- 
mum suppression of all non-trivial error types to at least 
N = jjwl[N u N 2 ,N 3 , N4], resulting in U(T) ~ 0(T N+1 ). 
We reconcile our analytical results with numerical simu- 
lations to confirm the predicted scaling of U(T) and show 
that this scaling is indeed MOOS-independent. Further- 
more, we analyze the scaling of all 2 individual error 
types to convey the validity of the decoupling order for- 
mula Eq. p9|. 



A. Model 

We consider a 2-qubit system coupled to a generic 
quantum bath to analyze the performance of the 4-layer 
NUDD. The total Hamiltonian, which includes a pure- 
bath term and system-bath interactions, is given by 



where a\. are the standard Pauli matrices with cr = 
/ for the j = 1,2 system qubits, B\ t \ 2 are arbitrary 
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bath-operators with ||-E>AiA 2 || — lj an d J\t\ 2 are bounded 
coupling coefficients between the qubits and the bath. 

Modeling the environment as a spin bath consisting 
of four spin- 1/2 particles with randomized couplings be- 
tween them, the operator B\ 1 \ 2 is given by 



B 



Ai A2 



A3 ,A4,A5 ,Aq 



AsA4Ar Afi 
^Ai A 2 



C"A 3 



1 £TA 4 Cg) CT As Cg) (T\ 6 , (89) 



where the interactions are non-restrictive, i.e. 1- to 4- 
local interactions are permitted. The index Xj = 0, x, y, z 
where j = 3, 4, 5, 6 stands for the bath qubit and coef- 
ficients c^ 3 ^ 4 5 6 G [0, 1] are chosen randomly from a 
uniform distribution. 

We focus on the case of uniform coupling, J\ t \ 2 = J 
VAj except Ai = A2 = 0, where we discriminate between 
the strength of the system-bath interactions and the pure 
bath dynamics Jqq. For all simulations, J = 1MHz and 
Joo = 1kHz are considered so that J 00 <C 7. In this 
particular regime, DD has been shown to be the most 
beneficial since the environment dynamics arc effectively 
static with respect to system-environment interactions 
[551110]. 



B. Overall decoupling order 

The overall performance of NUDD is quantified with 
respect to the state-independent distance measure 



D 



1 



Vdsdi 



mm\\U(T) 



7 eg) $1 



(90) 



where d$ is the system Hilbert space dimension, ds is 
the environment dimension, and $ is a bath operator 
|44j . The minimum order scaling of NUDD is expected 
to be D ~ 0[{JT) N+x ] for a total sequence duration T. 
Therefore, the overall numerical decoupling order of the 
NUDD scheme, n m ; n , is obtained by 



(]og 10 D)-l 



(91) 



In the subsequent simulations, the scaling of D is 
extracted by varying the minimum pulse delay r = 



Ts 



instead of the total time T. 



. i s i«-i=i ■ • • s h=l s ji=i 
We utilize r since this quantity is usually the most rele- 
vant experimental time constraint. 



C. r 4 = (n, r 2 ,r 3 ,r 4 )-type error 



Given a MOOS the general 2-qubit Hamilto- 

nian ( [88] ) can be partitioned into 2 4 error types #f 4 = 
H( ri ,r 2 ,r 3 r 4 ) [Eq. ffify] via the procedure discussed in Ap- 
pendix [XJ We also note that it is possible to generate 

{H(r u r 2 ,r 3 ,r i )} USmg #(1,0,0,0); #(0,1,0,0), #(0,0,1,0), an d 

#(0,0,0,1) as t ne generators of the Zf 4 group, where the 
complete set of error types can be obtained by all pos- 



sible products of the error generators: #^(i)#^<2) = 

r 4 r 4 

#-(i)_ -(2) . We explicitly display the 16 error types in Ta- 



ble III where each error type is generated by a product of 
the corresponding outer-most column and row elements. 





#(0,0,0,0) 


#(0,0,1,0) 


#(0,0,0,1) 


#(0,0,1,1) 


#(0,0,0,0) 
#(1,0,0,0) 
#(0,1,0,0) 
#(1,1,0,0) 


#(0,0,0,0) 
#(1,0,0,0) 
#(0,1,0,0) 
#(1,1,0,0) 


#(0,0,1,0) 
#(1,0,1,0) 
#(0,1,1,0) 
#(1,1,1,0) 


#(0,0,0,1) 
#(1,0,0,1) 
#(0,1,0,1) 
#(1,1,0,1) 


#(0,0,1,1) 
#(1,0,1,1) 
#(0,1,1,1) 

-^(1,1,1,1) 



TABLE III. 16 error types f 4 for the 4-layer NUDD scheme. 

In order to illustrate the procedure by which the er- 
ror types are generated, we consider the two MOOS sets 
utilized to convey the unbiased nature of the minimum 
scaling for NUDD. The first, 



,7 



7, <7. 



1} 



(92) 



is composed of single-qubit operators with the corre- 
sponding error generators 



#(1,0,0 

# 



0) 



(0,0,1,0) 



I 



#, 



(0,1,0,0) 



I 



# 



(0,0,0,1) 



7, 



(93) 



up to arbitrary bath operators which are omitted in the 
above equations. Substituting Eq. (93 1 into Table III we 
obtain the explicit forms for all error types, as shown as 
shown in Table HVl 
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a z ( 


*>I 
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37 
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3 a x 


7€ 


) a x 


<T X (g 


® X 


a z e 


5 a x 


(Ty& 


J a x 


7c 


g) a z 


7$ 


J a z 


<J X cg 


Oz 


& z <i 


1 a z 


(Ty C> 


D a z 


/C 


?)<T y 


7€ 


) a y 


Ux CE 


Oy 


<j z <s 


la y 


Oy^ 


•)cr y 



TABLE IV. The 16 error types r 4 for the 4-layer NUDD 
scheme with the MOOS {I (g> a z , I ® a x , a z <g> /, a x <g> /}. 

The second MOOS is chosen similarly as 

{a z ®a z J®a x ,<7 z ®I,(j x ®I}, (94) 

where we replace the a z Cg) I operator with a two-qubit 
(7 Z Cg) a z operator. The error generators for this MOOS, 
up to arbitrary bath operators, are 

#(1,0,0,0) = 7 ®(J X , #(0,1,0,0) = 7 <g> (T 2 , 

j, #(o,o,o,i) = Gz ® 7. (95) 



# 



(0,0,1,0) = fx <X) 

and the error types are given in Table [Vj 

D. Decoupling order of each error type 

The decoupling order for each error type is extracted 
using an equivalent procedure to Ref. [40) . where 



7±V 4 



\Tr s (U(T)H^)\\ F , 



(96) 
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TABLE V. The 16 error types r 4 for the 4-layer NUDD scheme 
with the MOOS {a z ®a z ,I®a x ,a z ®I,a x ®I} 



represents the effective error for error type Hp i present 
at the end of the DD evolution U(T). Here, \\A\\p is the 
Frobenius norm of A, 



\\A\\ F = TyVa^A, 



(97) 



the sum of singular values of A. (The choice of norm is 
somewhat arbitrary; we could have used any other uni- 
tarily invariant norm as well, e.g., the trace norm.). 

The expected scaling for each effective error type can 
be shown to follow 



£V 4 ~ 0[(JTf^ +1 ] ~ 0[(Jrf^ +1 ], 



(98) 



where A^ 4 is given by the Eq. (29 1. Therefore, the nu- 
merical decoupling order of r 4 -type error, n? 4 , is obtained 
by 



n fi = (log 10 £V 4 ) - 1 



(99) 



E. 



Comparison of analytical predictions with 
numerical results 



In order to convey the MOOS-independent scaling of 
NUDD performance, we consider the case where the three 
inner-most layers contain the same number of pulses, 
i.e. N 123 = Nj, j = 1,2,3, for both MOOS sets given 
above; see Eqs. J92l and ((941). We choose the NUDD 



layering such that the right-most control pulse operator 
corresponds to the outer-most UDD sequence with se- 
quence order N4 and the left-most operator generates 
the inner-most UDD sequence with sequence order N\. 
Fixing the sequence order for the outer-most layer, N4, 
and varying -/V123, a qualitative and quantitative similar- 
ity can be expected between the QDD analysis given in 
Ref.[2D] and the 4-layer NUDD construction considered 
here. Hence, the expected scaling of the distance mea- 
sure described by Eq. (90) is D ~ 0[{ Jr)" min+1 ], where 
«min = min[7V 4 , AT 123 ]. 

In Fig. [T] the performance is shown as a function of 
the minimum pulse interval r for both MOOS sets in 
the case of N4 — 3 in (a) and (c), and N4 = 4 in (b) 
and (d). The inner sequence orders are Ni = 1, 2, . . . , 6 
for all simulations, which are averaged over 15 random 
realizations of B\ 1 \ 2 using 120 digits of precision. 

Comparing Figs, [lja) and (c), we find that varia- 
tions in the MOOS do not change the qualitative fea- 
tures of NUDD performance. Furthermore, we can con- 



$(0,0,0,0) 




-$(0,0,1.0) 
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-$(0,0,0,1) 
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-$(0,0,1,1) 
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$(1,0,0,0) 
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$(1,0,1,0) 
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$(1,0,0,1) 

8 


$(1,0,1,1) 

8 


$(0,1,0,0) 

4 


-$(0,1,1.0) 

6 


■$(0,1,0,1) 

8 


-$(0,1,1.1) 
8 


$(1,1,0,0) 

4 


$(1,1,1,0) 

6 


■$(1,1,0,1) 
8 


$(1,1,1,1) 
8 



TABLE VI. For the case with Ni = 2, N 2 = 4, jV 3 = 6, 
A r 4 = 8, analytical and numerical decoupling orders for all 
error types are in complete agreement. The analytical overall 
decoupling order -/V min = min[2, 4, 6,8] =2 also agrees with 
the actual overall decoupling order. 



firm by linear interpolation that the slope of each curve 
indeed satisfies n min = min[iVi23, $4]- Similar results 
are obtained for A^ 4 = 4 in Figs, [ijb) and (d) as well. 
By this comparison, we show that NUDD is a MOOS- 
independent construction and confirm this result for two 
specific even and odd parity values of N4. We expect the 
scaling to remain consistent for general 4-layer NUDD, 
where the scaling becomes n m i n — min[iVi, N2, N3, N4], 
and for more general multi-layer NUDD as well. 

The scaling for each error type is characterized as a 
function of the minimum pulse interval using Eq. ( 96 ) 
specifically for the MOOS given by Eq. (92). For each er- 



ror type, we display the decoupling order in tabular form, 



following the format described in Sec. V C The numeri- 
cally obtained decoupling order is denoted in black, an- 
alytical prediction from Eq. ( 29 ) in blue and the " naive" 



decoupling order in red. If a blue (red) number is absent, 
it means our analytical decoupling order (naive decou- 
pling order ) is exactly the same as the actual decoupling 
order. We refer to 



N? t =max[{r i N i y i=1 ]. 



(100) 



as the "naive" decoupling order since it assumes that 
each nested layer acts independently. A previous analy- 
sis for QDD has shown that inter-layer interference may 
alter the expected UDD efficiency even when the layers 
do not provide decoupling for the same error type [40] . 
We expect a similar characteristic for NUDD as well and 
seek to identify the conditions when the naive decoupling 
order is incorrect. 

Tables |VI| |VII| and |VIII contain typical examples 
where our analytically predicted decoupling orders for 
all error types exactly match those obtained numerically. 
Note that in these particular NUDD schemes, the sup- 
pression order of each UDD laye r is equal to its sequence 
order, i.e., Ni = Ni. In Table ^ 
are even parity, i.e. [Ni]% = V 



all sequence orders 
Specifically, we 
consider Ni = 2, N 2 = 4, N 3 = 6, and iV 4 = 8, al- 
though we expect equivalent results for arbitrary even 
parity Nj. According to Eq. (29), the decoupling order 
formula for such cases coincides with the naive decou- 
pling order formula Eq. ( 100 ); interference between layers 
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(a) 




MOOS=(o-"®<r% l®o~ x , <r®I, <r r ®I) 




FIG. 1. Performance of NUDD as a function of the minimum pulse interval r for the two-qubit system specified by Eqs. 1 88 1 
and (89 1, averaged over 15 random realizations of B\ 1 x 2 . Error bars are included, but are very small. A four-layer construction 
is utilized to address all system-bath interactions, where the MOOS sets are chosen as Eq. (|92[) in (a) and (b), and Eq. (|94[) in 



(c) and (d). The orders of the first three layers are identical: -/V123 = Nj, j = 2, 3, 4 and are varied as -/V123 = 1, 2, . . . , 6. In 
particular, the outer layer sequence order is chosen as N4 — 3 in (a) and (c), and N4 — 4 in (b) and (d). Linear interpolation 
from log 10 Jt — —10 to approximately log 10 Jt = —2 confirms the MOOS-independent scaling D ~ C[(Jr) nmin+1 ], n m i n = 
min{Nj}j = i, for all simulations. 



is not observed. The analytical overall decoupling order 
N = min[2,4, 6,8] = 2 is found to agree with the actual 
overall decoupling order as well. 

The second example we consider, Table |VIH is one of 
the cases with all even sequence orders except the outer- 
most UDD layer. Analytical and numerical decoupling 
orders (black) are in complete agreement for all error 
types. There is one order difference between the naive 
decoupling order (red) and the actual decoupling order 
(black) in the last column and the last two entries in 
the third column. From the decoupling order formula 
Eq. p6| , the difference comes from the fact that the out- 
ermost (4 th ) UDD layer with odd order 3 boosts the de- 
coupling order for error types that are also addressed by 
one of the inner layers. Clearly, the deviation from the 
standard UDD scaling can be attributed to the asymme- 
try of the outer layer. 



sequence orders such that N 1 > N 2 > N 3 > N4 with 
[A^] 2 = 1 for all i. The analytical decoupling order for- 
mula is given in Eq. (38). The second term in Eq. (381 is 



called the outer-odd-UDD suppression effect, which im- 
plies a direct relationship between the number of odd 
parity layers which address the error, and the additional 
orders of error suppression achievable for a particular 
error type. Comparing the analytical/numerical results 
with the naive decoupling order, we find that the increase 
in decoupling order is attributed to this outer-odd-UDD 
suppression effect. 

Discrepancies between numerical results and analytical 
predictions occur when the inner UDD layers contain odd 
parity sequence orders that are smaller than the outer 
layers beyond the odd order. In Tables IX and |Xj we 



The third example, shown in Table VIII is for all odd 



consider two cases where deviations from the analytical 
predictions occur. It is important to note that although 
the numerical and analytical decoupling orders differ, we 
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^V(i. 1,1.1) 
6,5 



TABLE VII. For the case with Ni = 2, N 2 = 4, N 3 = 6, 
= 3, analytical and numerical decoupling order (marked 
as black) are in complete agreement for all error types. The 
analytical overall decoupling order TV = min[2, 4, 6, 3] = 2 also 
agrees with the actual overall decoupling order. The difference 
between the naive (marked as red) and the actual decoupling 
orders comes from outer-odd-UDD suppression effect. 



TABLE IX. For the case with Ni = 2, N 2 = 4, N 3 = 1, 
A r 4 = 6, the analytical overall decoupling order Nmin = 
min[2,4, 1,6] = 1 agrees with the actual overall decoupling 
order. However, the numerical decoupling order (marked in 
black) and our analytical predictions (marked in blue) have 
different values for error types in the third column. Naive 
decoupling order is marked in red. 



N (o,o, o.o) 
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TABLE VIII. For the case with TVi = 7, N 2 = 5, N 3 = 
3, Na — 1 analytical and numerical decoupling order for all 
error types are in complete agreement. The analytical overall 
decoupling order iV m i n = min[7, 5, 3, 1] = 1 also agrees with 
the actual overall decoupling order. The difference between 
the naive (marked as red) and the actual decoupling orders 
(marked as black) comes from outer-odd-UDD suppression 
effect. 
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TABLE X. For the case with JVi = 1, N 2 = 3, N 3 = 5, 
N4, = 7, the analytical overall decoupling order iV m i n = 
min[l,3,5,7] = 1 agrees with the actual overall decoupling 
order. However, the numerical decoupling order (marked in 
black) and our analytical predictions (marked in blue) have 
different values for some error types. Naive decoupling order 
is marked in red. 



do not predict error suppression beyond what is achieved. 
The formula given by Eq. (29) can be thought of as a 



lower bound for the actual decoupling order. Essentially, 
we are predicting the minimum order of error suppression 
for each error type. 

Table |IX| is an example where the third layer is the 
only odd parity layer; the sequence orders are chosen 
specifically as Ni = 2, N 2 = 4, N 3 = 1, N 4 = 6. Our 



analytical prediction from Eq. ( 29 1 is 



N ?i = max[{ nN, + r 3 } 2 l=1 ,r 3 N 3 , ri (r 3 [N 3 } 2 © 1)/V 4 ], 

(101) 

where the suppression ability of the fourth UDD layer, 
iV 4 = min[/V 4 ,A^ 3 + 1], is hindered by the third UDD 
layer if JV3 < YV4 — 1. For the particular sequence or- 
ders we consider here, the analytical prediction is A/4 = 
min[6, 1 + 1] = 2. Comparing the analytical results to 
numerical simulations, it appears that the analytical de- 
coupling orders (marked in blue) are lower than the nu- 
merically calculated values for certain error types; see 
colu mn 3 in Table |IX} However, despite the discrepancy 
Eq. ( 101 1 captures the inhibiting characteristics of the 



of the outer-most layer. As analytically predicted, the 
outer-odd-UDD suppression effects are observed on the 
error types, which anticommute with the third and its 
inner UDD layers, in the second column by comparing 
their naive and the numerical decoupling orders. For the 
error types with r 3 = r 4 = 1 in the fourth column of 
Table IX due to r 3 [N 3 ] 2 ©1 = 0, our analytical formula 
Eq. ( 101 ) shows that the fourth layer of UDD is totally 



ineffective due the odd sequence order of the third UDD 
layer. Indeed, the numerical decoupling order for the er- 



ror types in the fourth column of Table IX is smaller than 



odd parity inner sequence order on the decoupling order 



6, the sequence order of the fourth UDD layer. 

As a final example, we consider the sequence orders 
TVi = 1, N 2 = 3, A^ 3 = 5, and N A = 7. Since 
Nj = mm[Ni + l,Nj] = 2 < Nj for j = 2,3,4, this 
indicates that the suppression abilities of the outer UDD 
layers are diminished due to the TVi = 1 sequence or- 
der of the inner-most UDD layer. Indeed, the numerical 
decoupling orders are lower than the naive UDD scal- 
ing for a substantial fraction of error types, indicating 
interference between layers. However, we find that the 
analytically predicted decoupling orders still differ from 
the numerically obtained values. It appears that the ac- 
tual decoupling order is not only determined by the se- 
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quence orders, but also by the error types in a compli- 
cated fashion not fully captured by the analytical de- 
coupling formula, which, however, continues to provide 
a reliable lower bound on the decoupling order for each 
error type, as discussed earlier. 



VI. CONCLUSIONS 

The NUDD scheme, which nests multiple layers of 
UDD sequences, is the most efficient scheme currently 
known for general multi-qubit decoherence suppression, 
assuming ideal pulses and a bath with a sharp cutoff or 
bounded operator norm. In this work, we have given a 
rigorous analysis, along with a compact formulation, of 
the universality and performance of general NUDD se- 
quence with a MOOS as the control pulse set. We proved 
that the overall suppression order of NUDD with general 
control pulses is the minimum of all the sequence orders 
of the individual UDD sequences comprising the NUDD 
scheme. Moreover, we also obtained lower bounds on 
the decoupling order of each type of error, given a se- 
quence order set. Our decoupling order formula Eq. (29) 



shows that only for NUDD sequences with all even se- 
quence orders, all UDD layers work independently, i.e., 
the suppression ability of each UDD layer is unaffected 
by the presence of the other layers. For all other NUDD 
schemes, with at least one odd sequence order, the inter- 
ference phenomenon between UDD layers appears and is 
summarized as follows, 

1. For a given UDD layer, say the i th UDD layer with 
sequence order Ni , if there are inner layers with odd 
sequence order and the lowest odd order is smaller 
than Ni — 1, then the suppression ability of the i th 
UDD layer is hindered by this inner UDD layer with 
lowest odd order, i.e., it cannot achieve iV* h -order 
decoupling. [N t = A D fe ' <l + 1< N in Eq. J29}.] 



2. For the i th UDD layer to be effective against a given 
error type, this error needs to not only anticom- 
mute with the control pulses of the i th UDD layer 
(rj = 1) but also anticommute with an even num- 
ber of UDD layers with odd sequence orders before 
the i th UDD layer {p 9 (l,i- 1) = 0). 

3. For a given error type, if there is a total odd number 
of UDD layers with odd sequence orders before the 
i th UDD layer that the error anticommutes with 
(p ffi (l,t - 1) = 1), then the i th UDD layer is to- 
tally ineffective (p ffi (l,i — 1) © 1 = 0) irrespective 



of whether this error anticommutes with this layer 
or not. 

4. For a given error type, each odd order UDD layer 
that the error anticommutes with and is nested out- 
side the i th UDD layer, can enhance the suppres- 
sion ability of the i th UDD layer by one more order 
(outer-odd-UDD effect) on this error type. In other 
words, the outer-odd-UDD suppression effect is cu- 
mulative and is responsible for the p+(i + 1, £) term 
in the decoupling order formula Eq. ( p9| . 

Since our analysis identifies the conditions under which 
the suppression ability of a given UDD layer is inhibited, 
or made totally ineffective, or rather enhanced by other 
UDD layers with odd sequence orders, one can use it to 
design an optimally ordered NUDD scheme that exploits 
the full power of each UDD layer. To be more specific, 
suppose one would like to design an NUDD scheme from 
some UDD sequences whose control pulse types and se- 
quence orders are given. From the analysis presented 
here, in order to reach optimal efficiency, first one should 
nest all the UDD layers with even sequence orders to- 
gether, where the nesting orders can be arbitrary, and 
denote this resulting sequence as NUDD e ; second, nest 
all the UDD layers with odd sequence orders together 
such that the sequence orders from the inner-most to the 
outer-most layers are decreasing, and denote this result- 
ing sequence as NUDD D ; the final and optimal NUDD 
scheme is constructed by nesting NUDD e as the inner 
sequence with NUDD D as the outer sequence. 

An important challenge is to generalize the analysis 
we have presented here to the setting of non-ideal, finite 
width pulses. 
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Appendix A: Procedure to obtain H( rii 



Starting with the control pulse operator composing the 
first UDD layer, f2 1; the Hamiltonian can be partitioned 
into 



H 



71=0,1 



(n) 



where 



(ri) 



_ H + (-l) ri SliiJJli 



(Al) 



(A2) 



The Hamiltonian -ff(o) commutes with while -ff(i) an- 
ticommutes with f2i. Continuing the procedure for the 
next layer of NUDD, each -ff( ri ), with ri = or 1, can 
be divided further into commuting and anticommuting 
parts for the control pulse operator Q 2 as 



H, 



(ri,r 2 ) 



in) 



(A3) 



Using Eq. ( |A2[ ) iteratively for the first I layers, we have 

H( ri ,...,r e ) = [-ff(n,...,ri_i) + (-l) rf ^-ff(n,...,r (! _i)^£]/2 

(A4) 

which commutes with £li if =0 and anticommutes with 
O i if r,; = 1 , where i E {1, . . . ,£}. 



Appendix B: The outer-most UDD interval 
decomposition 



We shall derive Eq. (53) by splitting each integral 



of F ^( P ) [Eq. (21 1] into a sum of sub-integrals over 



the normalized outer-most layer intervals Sj e . Since 
F „(p) comprises a scries of time-ordered nested in- 

tegrals, our procedure for decomposing F is to 



19 



evaluate each nested integrals one by one, from rf n ' to 



We call the sub-integral over the jf 1 outer-most in- 
terval "sub-integral- jg" . Suppose the integral of the inte- 
gration variable jjW follows the sub-integral-^ p+1 ^ of the 
previous variable r]( p+1 \ By splitting up the integral of 
rj^ with respect to the normalized outer-most intervals 
Sj t , we have 

/ UMv^r* d V M (Bl) 

j (p+1) -i i-i 

e /^ty p> P 5 n/ 4 (i7 w )-« w *» w 

8 — 1 3 f 



,(p+l) £-1 



JlAp+i) , i=l 

3/ 1 



where 77. ( P > is UDD^ pulse timing and ft{j p ) 
(-l)^-i. 



for Eq. (B3b|. In this notation, Eq. (B3a| becomes 

/ n^ w ) p * w<6 » Wx 

j< p+i) -(« P ei) 

E ftU^p W s jW , (B4) 
where /-(?7 (p) ) = fi(v {p) ), and Eq. p3§ with £ p = 1 



becomes 



i=l 

j1 P+1) -(« p 01) 



where 



j< p+1) -(«pei) 




E 


- E 




.(p)_.-(p+i) 



which means that = j'j P+1 ^. 



(B5) 



(B6) 



For each sub- integral with the integration domain from 
[77 .( P ) , , 77 .(p) ), with pulse interval s.( P ) , we make the fol- 

lowing linear change of variable, 



SAp) 
Ji 



(B2) 



to normalize the integration domain to 1. Accordingly, 
each fi(fj^) with i < I — 1 becomes the normalized i th - 
layer modulation function for £—1 layers of NUDD, and is 
the same function for all of the outermost pulse intervals. 



Consequently the summands in Eq. (Bl) become 



Jo i=1 ,p )=1 



(p) 



sa P ) (B3a) 



;(P+1) 



(B3b) 



where f~ Yli=i fiiv^Y^ dfj^ is taken out of the sum- 
mation. 



It is possible to rewrite Eq. (B3a) and Eq. (B3b| in 
similar form by introducing the configuration number £ p , 
where £ p = stands for Eq. (B3a) while £ p = 1 stands 



Note that both forms of Eqs. (B4) and (|B5j) are natu- 
rally decomposed into an inner part, which is an integral 
of the first £ — 1 modulation functions for (£ — l)-layer 
NUDD, and an outer part, which is sum over the outer- 
most (£ th ) layer modulation function. 



Each integral of F-. n „( P ) can be split into Eqs. (B4) 
®p=i r t ' — ' 

and (B5), with one exception: if = 1 the sub- 

sequent sub-integrals from to vf- 1 ' will only contain 
Eq. (B5|, whose configuration number is 1. 



By substituting Eqs. (B4| and (B5) into each integral 

of F _( P ) , in sequence from t/™' to 77W (taking the ex- 

® v =i r i 

ception into account) and collecting all the inner (outer) 
parts of all sub-integrals accordingly we obtain 



(B7) 



where 



n r ( n <p+1) ) ip 

n / n/^ (p) r /? v p) , 
p=i j ° i=i 



(B8) 
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and 



N e + 1 



j< p+1) -(€ P ei) 



n e 

p=1 j1 p) =« p j1 p+1) +(? p ®i) ES =1 «»ffii: 



Mifr^V), (B9) 



with X){5 fc =o summing over all possible nested inte- 
gration (sum) configurations £„ . . . £2 £1 with = for 
the inner part ^ (the outer part <i>| ut ^ ) . 



Note that in Eq. (B9) for £ p = 1 



i< p+1) -(iei) 



E 



E 



(BIO) 



which means that the variables and ?/p +1 ) in Eq. ( 21 ) 
are in the same interval jf — j^ p+1 \ while for £ p = 



i< p+1) -(oei) 



E E ( B11 ) 

ji p) =o+(Offii)ES=i(e*ffii) j1 p) =E£ =1 (6=ei) 



see that each configuration f„ . . . £1 defines a way to 
separate the n vectors {ff }™ =1 into several clusters 
with order configuration numbers and error vectors as 



(») 



£iff • A cluster of vectors is defined as a con- 



tiguous set of vectors only connected by configuration 
numbers whose value is 1. Different clusters are sepa- 
rated by configuration numbers whose value is 0. For 
£ 8 • • • £1 = 01101011 in Eqs. ( |B12[ ) and ( |B13[ ) as an ex- 
ample, we have 

ff Iff) Iff ff } Iff ff Iff Iff (B14) 

which divides the 8 vectors into three clusters. Sup- 
pose for a given configuration £ tt . . . £2 £1 , which separates 
the n vectors into m clusters, the a th cluster (count- 
ing from right to left) contains n a vectors {rj }p— v 
with n n = [i — v + 1. Then, according to the result 



of 



/GO 



p—V 



its corresponding inner part is F^) with 



Ap) 



(a) 



7p=v' l- 



j and the corresponding outer part is 



with r 



(a) 



Ap) 



• t-x 

that for the last cluster, a — m, the upper bound of the 
sum j( m+1 ^ — 1 is replaced by Ni + 1. Therefore, Eq. (|B7[ ) 
can be re-expressed in the more compact form of Eq. (1531) . 



In the latter case, the variable r]^ in Eq. (21 1 is located 
in the earlier interval than the variable i]^ p+ \ namely, 

jf < j { e p+1) . Efc=i(6 © l ) counts the number of £ fe = 
from k = 1 to k = p. 

Furthermore, each <I>™ ^({ff i}p=i) is a product of 
multiple nested integrals with the modulation functions 
of l— \ layers of NUDD as integrands. In fact, each nested 
integral contained in & }p=i) i s actually one 

of the (£— l)-layers NUDD coefficients. For example, an 
inner term appearing in the expansion of the 8 th order 
Mayers NUDD coefficients F s ^< P > reads 

^01101011 ( {ff-llp=l ) — ^0 S _ r (p) ^0 5 _ r <p) ^fo 3 i? <p) 



p=6' £— 1 ^p=4'^-l ^ D p = l' £-1 

(B12) 



F (3 2 F^} 



' t— 1 1 



where 



- m8 Ap) 



'p=i^_i- Its corresponding outer part reads 

iVe+l 

^rUon({rf }« =1 )= £ W?)*~ r < - 



— o\5 



/GO 



and 



j S '=3 



7 (8) -l 



, <5) -l 



E 6 r^ r -?« >< E mr Ur < s) 

a — "* j 1 a — "* 



i< 5) =2 



if =1 



(B13) 



where we used Eq. ( 14 ) 



As suggested by Eqs. (B12) and (B13), one can 



Tt-x 



Appendix C: F~ in terms of F 1 '" ' 



We prove Lemma [lj N? t _ 1 < Y^=x -^-><<>> i where 
<<*} 



'o=l' £-1- 



Proof of Lemma^ For a given f^_i-type error with 
3S=ir«-i [Eq- (55a|], its decoupling order is 



ri-i 



[Eq. (291] 



N fe _ 1 = max [ri(p e (l,i-l)el)JV f +jj + (i+l,^-l)] 

(CI) 

Suppose the maximum occurs at the M th UDD layer, 
i.e., 



ftrt-i =N M +P+(M + 1,1-1) 



(C2) 



where the coefficient of Nm is equal to 1, implying that 

r M = 1 (C3a) 
p e (l,M-l) = 0. (C3b) 



lowing equalities and inequality. First, 



With Eq. (|55a|), Eq. (|C3a|), Eq. (|C3b|) , we have the fol- 

r M = 1, (C4) 



>m (a) 
?a=l ' M 
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second, 

Pffi (l,M- 1) = 8^V fc [iV fe ] 2 = 0^(0™ 1 ri a> )[iV fc ] 2 
= ®^i(<L^< a> [iV fc ] 2 ) 

= ®r=ii^ > i, (C5) 

where to keep the notation more compact we defined 

l^l^O^iNkh- (C6) 



Thus, using Eq. (C3b) 



1^1=0, 



(C7) 



and third, 



p+(M + M-l) = ^ r fe [iV fc ] 2 

fc=Af+l 
£-1 



fc=M+l 
£-1 m 



< E E^» 

k=M+l a=l 
m t-1 

<E E r t l) W k h. (C8) 



a=l fc=A/+l 



By using the properties max[A, B,C] > max[j4, B] and 
max[A + c,B + c] = max[A, B] + c we have 



E^> 



a=l 



Emax[{ 



. ( a ) n,A a ) i 



m+ e rriwti 



a=l 



>E™x[{r< a) (|^|®l)^ + E r^W.h}^} 

0=1 fe=i+l 
m A/ 

>Emax[{r< a> (|^|®l)^+ E ^[W^] 

a=l 

+ E E rt } lN k h 



k=i+l 



a=l fc=A/+l 



(C9) 



Equivalcntly, 



which will immediately lead to Lemma 



The condition 



» 



that there must exist at least one a' such that ri^ 



1. For this vector with r 
components either satisfy \v^ a 



r M = 1 [Eq. (|C4|] implies 
one 

(a') 



A I 



1, its first M - 1 



A/ 



or 1^ 



<o'>i 

M I 



For the -type error with ^ 



1. 

>' 



the coefficient r 



(°'>(|j,<°'>| 



1 and \u£'\ = 0, 



is not zero. Therefore, 

N p(a , } =m ax [{---}f^ 1 ,N M ]>N 



1) of N M in [Eq. (|29j] 

w (C12) 



which leads to Eq. (Cll ) 



However, for the rff-type error with 



(a') 



1, the coefficient rffl |©1) of iV M in Eq. (l29|) 



<a'> 



= 1 and 



ends up vanishing, leading to 

iV> =max[{...}flr\0] 



(C13) 



which cannot determine whether N^'y > -^m- This 
notwithstanding, ^ = ©j^Vi" ^ [A^k = 1 indicates 



»fej2 

that among the M — 1 components of the vector fj^ ^ , 
there is a total odd number of components each of which 
is equal to 1, and associates with an odd sequence or- 
der. Suppose the o' component is the outermost nonzero 
component r > — 1 with [iV,y] 2 = 1 of . Accordingly, 



A/ 

E '< 

fe=o' + l 



[^fe] 2 = 



(C14a) 
(C14b) 



With Eq. (C14a| and Eq. (C14b), it follows that 



= max[{- • • }? = 1\ JVc, {• • • > iV , > 7V M -1 

' M 

where the last inequality is derived as follows: 

N M = rnmLV M , A^< n M + 1] = min[Ar M , N k D <( + 1] 

< N*<°' + 1 = min^"'- 1 + 1, N a > + 1] 

< A^ + 1 (C16) 



E ^-i " 51 ^> + E E r fe° > [ iV '=]2' ( C1 °) there is another vector r<°"> such that 



Moreover, if ' = 1, Eq. (C7) implies that 

>">| = 



a=l k=M+l 



M-l(a") 



In light of the inequality ( C8 1 , the remaining task is to 
show 



E^v> — 

a=l 



(Cll) 



M UlldU 

[Afc] 2 = 1. Using the same argument, there 

must exist a r c » = 1 with v^, — with an odd sequence 
order N a », o" < M. It follows that 

N ^ = max[{- • • }°^\ N a „, {■ ■ ■ }?l „ +1 } > N Q „ > 1. 

' M 

(C17) 
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Therefore, with Eqs. ( |C15| ) and ( |C17[ ), 

m 

V JV^.) > N {a , } + N p (a") > N M 



0=1 



which proves Eq. ( CI 1 1 



(C18) 



□ 



the Fourier expansion of G\{9) is 

00 

G i( d ) = 12 12 9k, q sm[2k{N e + l)6 + q9} (D3) 

fc=Qg=-l,l 

whose detailed calculations can be found in Ref. [39] . In 
accordance with Definiti on [l the function types of fe(9), 
fn <e (0), and G x {9) [Eq. ([m|- (|D3|] are identified as Eqs 
(66 1- (68), respectively. 



Appendix D: Fourier expansions after linear change 
of variable 



The £ th -layer modulation function fe(9), which alter- 
nates in sign with the new outer-most-layer pulse timing 
9j t = jy^pj) has a period of j^pr since the outer-most- 
layer pulses are now equally spaced. Therefore, fe(0) has 
a simple Fourier expansion 

00 

Stiff) = J2 4 sin[(2fc + l)(N e + 1)6], (DI) 

fe=0 



. The remaining modulation functions 
< £, which switch sign with the 



With d k ~ (2fe+T)7r 

such as fa(9) with i 
inner-layer UDD pulse timing 9 j e ,...,j 
period wttt- This is because the inner 



all have the same 
tvj+t. ±mo 10 ucloudc wk imier {i — l)-layers 
NUDD pulse timing structure inside each [9j e _i,9j e ) is 
still preserved and each are actually identical to each 
other, in that Eq. (61) just rescales the outer-most layer 
interval [# Jf _i, 9j e ) linearly. In addition, due to the time- 
symmetric structure of NUDD pulse timings, it follows 
that inside each [9j e _i,9j e ), fa (8) for i < I is an even func- 
tion when Ni is even, and odd when JV< is odd. Therefore, 
it follows that the Fourier expansion of fa(9) has the fol- 
lowing form: 



4 cos[2fc(A^ + 1)9], N, even 



im = { 



k=0 

00 



(D2) 



4 sin[2fc(iV £ + 1)6], N, odd 



fe=i 



for i < I. Note that the Fourier expansion coefficients d\ 
in the even and odd cases are, in fact, different. However, 
we use the same notation for both since the exact values 
of these coefficients are irrelevant for our proof. Finally, 



Appendix E: min[{7V- 



}] = N, 



k'<t 



N k ' <e . 



We prove LemmajiJ min[{^V f ^_ ie ^i i }] 

Proof of Lemma^ For all fi-\ G ^t-\\ their components 
satisfy, by definition, the condition I— 1) = I, which 
implies that the total number of non-zero components 
with odd sequence orders in r^-\ is odd. Suppose that 
before the £th-layer, the o^ h , o| h , . . . , and o* h UDD layers 
where < 0\ < o 2 < ■ ■ ■ < oj < i are the layers with odd 
sequence orders, namely, [N 0i ] 2 = 1 for i = 1, 2, . . . , j. 

Define e Di as the f^-i-type errors with r Qi — 1 and all 
Tj^Oi = 0. For e Q .-type errors which obviously belong to 
the zT/_ 1 -type error, we have Ng g . — N Qi according to the 
decoupling order formula Eq. (29 1. 

For the remaining v} x vectors, they have 3 or more 
non-zero components associated with odd orders. For a 
given r^_i 7^ e Qi 6 vl_ 1 , suppose the inner-most compo- 
nent with odd order is r 
follows 



1. Then its decoupling order 



^Pt-i^e 0z evl_ 1 = max[. ..,N Qi + ^ r k[N k ] 2 , ■ ■ ■] 

fc= 0i + l 

> N 0i = N So . (El) 

Therefore, min[{iVf^_ ie jj 1 }] will occur among the de- 
coupling orders of the errors which has only one non-zero 
component with odd sequence order, i.e., 



min^iV^^iJ] = min[7V e - oi , iV e - o2 

= mm[N 0l ,N Q2 ,. 
= min[iV 0l , N 02 , . 



,N 



(E2) 



Eq. (E2) is the same expression as <l = 
min{Nk' | [iVfc/]a = 1} [Eq. (31)], which completes the 
proof of Lemma [4] 



□ 



